29-April-2024
Cool stuff happening in Trenton
https://www.meetup.com/trenton-makes-tech-_/
FLaNK / KNIFe AI Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
*This is Issue #135 *
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
New Releases
Articles
https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa
https://medium.com/@tspann/searching-slack-from-apache-nifi-9ed562aa2397
https://medium.com/@tspann/events-streams-flows-and-maps-22a8d27cd9b4
https://medium.com/@tspann/storing-meetup-user-data-as-events-dad3b1dc89f5
https://medium.com/@tspann/real-time-in-boston-part-1-0f92d7da3496
https://thenewstack.io/apache-nifi-2-0-0-building-python-processors/
https://medium.com/plain-simple-software/the-llm-app-stack-2024-eac28b9dc1e7
https://blog.cloudera.com/climate-and-sustainability-hackathon-meet-the-judges/
https://huggingface.co/blog/vlms
https://haystack.deepset.ai/blog/chatting-with-sql-databases-3-ways
https://www.denoise.digital/llama-3-get-started-with-llms/
https://www.pinecone.io/learn/chunking-strategies/
https://www.pinecone.io/blog/canopy-rag-framework/
https://www.pinecone.io/learn/series/rag/embedding-models-rundown/
Picking an Embedding Model
https://www.pinecone.io/learn/series/rag/embedding-models-rundown/
https://huggingface.co/spaces/mteb/leaderboard
Retrieval Augmented Generation Assessment (RAGAS)
Metrics-Driven Agent Development
https://www.pinecone.io/learn/series/rag/ragas/
https://engineering.grab.com/data-observability
JSON Lines (JSONL)
https://jsonlines.org/
https://www.timeplus.com/post/real-time-ai-oss-tools
https://www.jetson-ai-lab.com/research.html#meeting-schedule
https://www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product
https://zilliz.com/learn/pandas-dataframe-chunking-anf-vectorizing-with-milvus
Videos
Search Slack
https://www.youtube.com/watch?v=3ugppfb2kN8&t=5s&ab_channel=DatainMotion-HowToBeaStreamingEngineer
MBTA Transit Live with LLM
https://www.youtube.com/watch?v=JGGY_uzQTdY&t=3s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D
Events, Streams, Maps with Irish Rail
https://www.youtube.com/watch?v=14CSQRfUWoE&t=684s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D
Building Real-Time Pipelines
XTremeJ
https://www.youtube.com/watch?v=SszeF57IdW4
Adding Generative AI to Real-Time Streaming Pipelines | Tim Spann | Conf42 LLMs 2024
https://www.youtube.com/watch?v=Yeua8NlzQ3Y
MLConf NYC 2022
https://www.youtube.com/watch?v=Vw-jlU8STBk
Summer School Data Science Festival
https://www.youtube.com/watch?v=0G98z_fs_SQ
ScyllaDB Summit 2023
https://www.youtube.com/watch?v=ZwhoosP1UWU
https://www.youtube.com/watch?v=-_52DIIOsCE&ab_channel=JamesBriggs
https://www.youtube.com/watch?v=XFZ-rQ8eeR8
Slides
https://www.slideshare.net/slideshow/april-2024-nlit-cloudera-realtime-llm-streaming-2024/267269851
https://www.slideshare.net/slideshow/realtime-ai-streaming-ai-max-princeton/267269816
https://www.slideshare.net/slideshow/conf42llmadding-generative-ai-to-realtime-streaming-pipelines/267269788
Events
May 1, 2024: Gen AI in the Enterprise Cloud. Virtual.
https://www.linkedin.com/events/7180985346103410688/comments/
https://lu.ma/q7pcfyjn
May 8-9, 2024: Data Summit 2024. Boston, MA.
https://www.dbta.com/DataSummit/2024/default.aspx
https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx
https://twitter.com/DBTADataSummit/status/1778393005646397636
May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.
June 12, 2024: Budapest Data + ML Forum. Virtual.
https://budapestml.hu/2024/en/speakers/
June 20, 2024: AI Camp Meetup. NYC.
Sept 24, 2024: JConf.Dev. Dallas.
https://2024.jconf.dev/session/598816
Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/
Nov 19, 2024: XtremePython. Online.
https://xtremepython.dev/2024/
Cloudera Events
https://www.cloudera.com/about/events.html
More Events:
https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
Code
- https://github.com/tspannhw/FLaNK-python-processors
- https://github.com/tspannhw/FLANKAI-Boston
- https://github.com/tspannhw/FLaNK-IrelandTransit
Models
- https://huggingface.co/papers/2404.14219
- https://github.com/haizelabs/llama3-jailbreak
- https://huggingface.co/microsoft/Phi-3-mini-128k-instruct
- https://huggingface.co/microsoft/Phi-3-mini-128k-instruct-onnx
- https://github.com/google/maxtext
- https://github.com/apple/corenet
- https://openbmb.vercel.app/minicpm-v-2-en
- https://github.com/OpenBMB/MiniCPM-V
- https://developer.nvidia.com/blog/new-standard-for-speech-recognition-and-translation-from-the-nvidia-nemo-canary-model
- https://huggingface.co/Snowflake/snowflake-arctic-instruct
- https://starling.cs.berkeley.edu/
- https://github.com/QwenLM/Qwen1.5
Tools
- https://learn.deeplearning.ai/courses/getting-started-with-mistral/lesson/1/introduction
- https://github.com/picosh/pico
- https://github.com/kylebarron/parquet-wasm
- https://wasmer.io/posts/py2wasm-a-python-to-wasm-compiler
- https://github.com/ezrosent/frawk
- https://demos.littleworkshop.fr/infinitown
- https://dos.itch.io/fsdoom
- https://healeycodes.github.io/doom-checkboxes/
- https://github.com/sensebox/openSenseMap-API
- https://github.com/discord/discord-example-app
- https://pypi.org/project/gdown/
- https://pypi.org/project/PyPDF2/
- https://johnzon.apache.org/index.html
- https://ionutbalosin.com/2024/03/analyzing-jvm-energy-consumption-for-jdk-21-an-empirical-study/
- https://github.com/rastapasta/mapscii
- https://github.com/langgenius/dify
- https://www.chronon.ai/
- https://aiindex.stanford.edu/report/
- https://github.com/WecoAI/aideml
- https://github.com/thomiceli/opengist
- https://hashquery.dev/
- https://github.com/google/paxml
- https://github.com/NVIDIA/Megatron-LM
- https://carapace.sh/
- https://github.com/karpathy/minGPT
- https://huggingface.co/Snowflake/snowflake-arctic-instruct
- https://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca
- https://www.udio.com/
- https://github.com/latitude-dev/latitude
- https://github.com/pytorch/torchtune
- https://github.com/aiplanethub/beyondllm
- https://github.com/leftmove/cria
- https://harper.blog/2024/04/12/i-accidentally-built-a-meme-search-engine/
- https://github.com/mlfoundations/open_clip
- https://github.com/ml-explore/mlx-examples/tree/main/clip
- https://github.com/harperreed/mlx_clip
- https://github.com/youtube/api-samples
- https://www.intel.com/content/www/us/en/developer/tools/openvino-toolkit/download.html?VERSION=v_2024_1_0&OP_SYSTEM=MACOS&DISTRIBUTION=ARCHIVE
- https://github.com/AndroidIDEOfficial/AndroidIDE
- https://github.com/datafaker-net/datafaker
- https://pi4j.com/blog/2024/20240425_interview_tom_aarts/
- https://github.com/akalenuk/the_namingless_programming_language
- https://github.com/microsoft/MonitoFi
- https://github.com/earwig/mwparserfromhell
- https://docs.cohere.com/reference/customer-support
- https://github.com/pinecone-io/canopy
- https://github.com/pinecone-io/examples/blob/master/learn/generation/langchain/handbook/05-langchain-retrieval-augmentation.ipynb
- https://github.com/prompt-security/ps-fuzz
- https://github.com/dream-num/univer
- https://github.com/igorgorbenko/songs_recommendation/blob/main/data_preparation.ipynb
- https://www.xda-developers.com/back-up-your-raspberry-pi/
- https://github.com/OWASP/OFFAT
- https://github.com/truefoundry/cognita
- https://llm.extractum.io/static/blog/?id=llm-token-pricing#
Discount
Discount access to DataSummit 2024
https://secure.infotoday.com/RegForms/DataSummit/?Priority=24SPKR
Β© 2020-2024 Tim Spann
Top comments (0)